CLAIMS 

What is claimed is: extracellular matrix, 

1 A method of inhibiting the ability of a cell 

. ■ u ♦••«,, nf 99437 Drotein expressed by the cell, whereby 
the method comprising inhibiting the activity of 22437 protein p 

5 the ability of the cell to degrade the extracellular matrix is inhibited. 

2 Themethodofclaiml,whereintheactivityof22437 P roteinexpres S edb^ 
the cell is inhibited by inhibiting expression of the 22437 gene in the cell. 
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3 The method of claim 2, wherein expression of .he 22437 gene is inhibited by 

conditions with a transcript of the 22437 gene. 

4. The method of claim 3, wherein the antisense ohgonucleotide comprises a, 
least 15 nucleotide residues. 

5. The method of claim 3, wherein the transcript is an mRNA. 

6 The method of claim 2, wherein expression of the 22437 gene is inhibited hy 
conditions with a polynucleotide having the nucleotide sequence SEQ ID NO: 1. 



7 The method of claim 2, wherein expression of the 22437 gene is inhibited by 
25 conditionswimapolynucleotidehavingthenucleotidesequenceSEQIDNO^. 

8 The method of claim 1. wherein the activity of 22437 protein expressed by 
tKe eel, is irmibited by inhibiting a cataiytic activity of 22437 protein without significantly 

affecting 22437 gene expression in the cell. 

30 
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9. The method of claim 1, wherein the activity of 22437 is inhibited by 
administering to the cell an agent which inhibits an activity of 22437 protein. 

10. The method of claim 9, wherein the agent is an antibody which specifically 
5 binds with 22437 protein. 

11. The method of claim 9, wherein the activity is sulfatase activity. 

12. The method of claim 9, wherein the activity is ability to degrade an 
10 extracellular matrix. 

n 13. The method of claim 1 , wherein the cell is a tumor cell. 



14. The method of claim 13, wherein the tumor cell is selected from the group 
5 1 5 consisting of a colon tumor cell, an ovarian cancer cell, a breast cancer cell, a lung cancer cell, 
%j and a glioblastoma cell. 

E3 1 5 The method of claim 1 , wherein the cell is a vascular endothelial cell. 

□ 

w 

20 1 6. The method of claim 1 , wherein the cell is a neuronal cell. 

17. The method of claim 16, wherein the neuronal cell is selected from the 
group consisting of an astrocyte, a neuron of the cerebral cortex, and a neuron of the 
hypothalamus. 

25 

18. The method of claim 1, wherein the cell is in the body of a human. 

19. A method for assessing whether a test compound is useful for modulating at 
least one phenomenon selected from the group consisting of tumor establishment, tumor 
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growth, .umor metastasis, epithelial ce.1 proliferation, endothelial cel. proliferation, neuronal 
cell growth, wound healing, and cerebral injury healing, the method comprising: 

a) adding the test compound to a first composition comprising a polypeptide that has an 
ammo acid sequence a, leas, 80% identical to SEQ ID NO: 2 and that exhibits a 22437 activity 

5 and; . 

b) comparing the 22437 activity in the first composition and 22437 activtty m a second 

composition that is subsumtiany identical to the first, except that it does no. comprise the test 

compound, 

whereby a difference in 22437 activity in the first and second compositions is an indicate that 
10 the test compound is useful for modulating the phenomenon. 

20. The method of claim 19, wherein the activity is selected from the group 
consisting of sulfatase activity and ability to degrade an extracellular matrix. 



21. The method of claim 19, wherein the protein has the amino acid sequence 
N SEQ ID NO: 2. 



15 



0 22. The method of claim 19, wherein the first composition comprises a cell 

5 comprising a nucleic acid encoding the protein. 
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23. The method of claim 22, wherein the nucleic acid is the genome of the cell. 

24. The method of claim 22, wherein the nucleic acid comprises the 22437 gene 

25. A method for assessing whether a test compound is useful for modulating at 
least one phenomenon selected from the group consisting of tumor establishment, tumor 
growth, tumor metastasis, epithelial cell proliferation, endothelial cell proliferation, neuronal 
cell growth, wound healing, and cerebral injury healing, the method comprising: 
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a) adding the test compound to a first composition comprising a cell which comprises a 
nucleic acid that encodes a polypeptide that has an amino acid sequence at least 80% identical 
to SEQ ID NO: 2 and that exhibits a 22437 activity and; 

b) comparing 22437 activity in the first composition and 22437 activity in a second 
5 composition that is substantially identical to the first composition, except that it does not 

comprise the test compound, 

whereby a difference in 22437 activity in the first and second compositions is an indication that 
the test compound is useful for modulating the phenomenon. 

10 26. A method of making a pharmaceutical composition for modulating at least 

one phenomenon selected from the group consisting of tumor establishment, tumor growth, 
P tumor metastasis, epithelial cell proliferation, endothelial cell proliferation, neuronal cell 
| growth, wound healing, and cerebral injury healing, the method comprising: 
IS a) selecting a test compound useful for modulating the phenomenon according to the 

w 

rU 15 method of claim 19; and 

b) combining the test compound with a pharmaceutical^ acceptable carrier in order to 

make the pharmaceutical composition. 
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■j 27. A method of modulating, in a human, at least one phenomenon selected 

from the group consisting of tumor establishment, tumor growth, tumor metastasis, epithelial 
cell proliferation, endothelial cell proliferation, neuronal cell growth, wound healing, and 
cerebral injury healing, the method comprising administering the pharmaceutical composition 
of claim 26 to the human in an amount effective to modulate the phenomenon. 



25 



28. A method for identifying a compound useful for modulating at least one 
phenomenon selected from the group consisting of tumor establishment, tumor growth, tumor 
metastasis, epithelial cell proliferation, endothelial cell proliferation, neuronal cell growth, 
wound healing, and cerebral injury healing, the method comprising: 

a) contacting the test compound and a polypeptide selected from the group consisting 

30 of 
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i) a polypeptide which is encoded by a nucleic acid molecule comprising 
a portion having a nucleotide sequence which is at least 60% identical to either 

of SEQ ID NOs: 1 and 3; and 

ii) a fragment of a polypeptide having either an amino acid sequence 
5 comprising SEQ ID NO: 2, wherein the fragment comprises at least 1 5 

contiguous amino acid residues of SEQ ID NO: 2 
or a cell that expresses the polypeptide; and 

b) determining whether the polypeptide binds with the test compound, 
whereby binding of the polypeptide and the test compound is an indict 

10 compound is useful for modulating the phenomenon. 

29 The method of claim 28, wherein the polypeptide exhibits an activity 
selected from the group consisting of sulfatase activity and abUity to degrade an extracellular 
matrix. 



□ 

[U 

^ 15 30. The method of claim 28, wherein the polypeptide exhibits an epitope in 
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common with a polypeptide having the amino acid sequence SEQ ID NO: 2. 
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SEQUENCE LISTING 

<110> GLUCKSMANN, Maria A. 
MEYERS, Rachel 
KAPELLER-LIBERMANN, Rosana 
SILOS- SANTIAGO, Inmaculada 

< 12 0> 22437, A NOVEL HUMAN SULFATASE AND USES THEREFOR 
<130> 10147-61U1 

<140> NOT YET ASSIGNED 
<141> 2001-10-03 

<150> US 60/257,082 
<151> 2000-12-21- 

<160> 12 

<170> Patentln Ver. 2.1 



H <210> 1 
O <211> 3513 
fU <212> DNA 

<213> Homo sapiens 



a. I 



=«.«=«, tcgt «c„ 5 ^ r:rr - : iiizzz r 20 

jJ a« t atcacg atatttgatt tattagcgat ,= £gtg ^ 

» J , cgtgcacaca aggctctggc tcgcttccet ccctcgtt * atctqcgagt 240 

<& catctgtttc aactctccgc cgagggcga, caggagcgag « ^ 

S — ' — '""el "= c X 9 =tgtgc tt g 3,0 

aaagaagcac cagatcagca aaaaaagaag atggg rcttcctqtc gcaccaccgc 420 

ctgtccgcaa ctgtgttctc cctgctgggt ggaagctcg ccttcctjtc g ^ 

ctgaaaggca ggtttcagag ggaccgcagg a * cat ^c ° ccggcgcatc 540 

acggacgacc aggatgtgga gc = g c tgaaca^ gg^ ^ 

atggagcagg gcgggacgca cttcatcaac ^ acaaca ccta caccaacaat 660 
tcacgctcct ccatcctcac cggcaagtac gtccacaacc * tgccgtgtac 720 

ga gaactgct cctcgccctc ctggcaggca cagca ccgcac ^ 

ctcaatagca ctggctaccg gacagctttc ttcggg g ccgcttttat 840 

tcctacgtgc cacccggctg gaaggagtgg gtcggactcc "aaaaactc J ^ 

aactacacgc tgtgtcggaa cggggtgaaa gagaagcac, ctcc c ^ 

tacctcacag acctcatcac caatgacagc gtgagcttct t«*cacg^ J ^ 

tacccgcaca ggccagtcct catggtcatc agccatgcag cccccca gg * ^ 

=: = =; = = =: - 
;:::r- :;:= „ uw 




tacatcgtat acaccgccga ccacggttac cacatcggcc agtttggcct ^gaaaggg 1320 
aaatccatgc catatgagtt tgacatcagg gtcccgttct acgtgagggg -c 
gaagccggct gtctgaatcc ccacatcgtc ctcaacattg acctggcccc ^ccatcctg 1440 
gacattgcag gcctggacat acctgcggat atggacggga aatccatcct caagctgctg 500 
gacacggagc ggccggtgaa tcggtttcac ttgaaaaaga agatgagggt ctggcgggac 5 0 
fccttcttgg tggagagagg caagctgcta cacaagagag acaatgacaa ggtggacgcc 
caggaggag. actttctgcc caagtaccag cgtgtgaagg acctgtgtca gcgtgctgag 
taccagacgg cgtgtgagca gctgggacag aagtggcagt gtgtggagga cgccacgggg 
aagctgaagc tgcataagtg caagggcccc atgcggctgg gcggcagcag agccctctcc 1800 
aacctcgtgc ccaagtacta cgggcagggc agcgaggcct gcacctgtga cagcggggac 1860 
tacaagctca gcctggccgg acgccggaaa aaactcttca agaagaagta caaggccagc 1920 
tatgtccgca gtcgctccat ccgctcagtg gccatcgagg tggacggcag ggtgtaccac 9 0 
gtaggcctgg gtgatgccgc ccagccccga aacctcacca agcggcactg gccaggggcc 040 
cctgaggacc aagatgacaa ggatggtggg gacttcagtg gcactggagg ccttcccgac 00 
tactcagccg ccaaccccat taaagtgaca catcggtgct acatcctaga gaacgacaca 2160 
gtccagtgtg acctggacct gtacaagtcc ctgcaggcct ggaaagacca caagctgcac 2220 
atcgaccacg agattgaaac cctgcagaac aaaattaaga acctgaggga agtccgaggt 2280 
cacctgaaga aaaagcggcc agaagaatgt gactgtcaca aaatcagcta ccacacccag 2340 
cacaaaggcc gcctcaagca cagaggctcc agtctgcatc ctttcaggaa gggcctgcaa 2400 
gagaaggaca aggtgtggct gttgcgggag cagaagcgca agaagaaact ccgcaagctg 2460 
ctcaagcgcc tgcagaacaa cgacacgtgc agcatgccag gcctcacgtg cttcacccac 2520 
gacaaccagc actggcagac ggcgcctttc tggacactgg ggcctttctg tgcctgcacc 
agcgccaaca ataacacgta ctggtgcatg aggaccatca atgagactca caatttcctc 2640 
ttctgtgaat ttgcaactgg cttcctagag tactttgatc tcaacacaga cccctaccag 2700 
ctgatgaatg cagtgaacac actggacagg gatgtcctca accagctaca cgtacagctc 2760 
atggagctga ggagctgcaa gggttacaag cagtgtaacc cccggactcg aaacatggac 2820 
ctgggactta aagatggagg aagctatgag caatacaggc agtttcagcg tcgaaagtgg 2880 
ccagaaatga agagaccttc ttccaaatca ctgggacaac tgtgggaagg ctgggaaggt 2940 
taagaaacaa cagaggtgga cctccaaaaa catagaggca tcacctgact gcacaggcaa 3000 
tgaaaaacca tgtgggtgat ttccagcaga cctgtgctat tggccaggag gcctgagaaa 3060 
gcaagcacgc actctcagtc aacatgacag attctggagg ataaccagca ggagcagaga 3120 
taacttcagg aagtccattt ttgcccctgc ttttgctttg gattatacct caccagctgc 3180 
acaaaatgca ttttttcgta tcaaaaagtc accactaacc ctcccccaga agctcacaaa 3240 
ggaaaacgga gagagcgagc gagagagatt tccttggaaa tttctcccaa ^gaaagt 3300 
cattggaatt tttaaatcat aggggaaaag cagtcctgtt ctaaatcctc ttattctttt 3360 
ggtttgtcac aaagaaggaa ctaagaagca ggacagaggc aacgtggaga ggctgaaaac 3420 
agtgcagaga cgtttgacaa tgagtcagta gcacaaaaga gatgacattt acctagcact 3480 
ataaaccctg gttgcctctg aagaaactgc ctt 



<210> 2 
<211> 870 
<212> PRT 

<213> Homo sapiens 
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Met Gly Pro Pro Ser Leu Val Leu Cys Leu Leu Ser Ala Thr Val Phe 
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Ser Leu Leu Gly Gly Ser Ser Ala Phe Leu Ser His His Arg Leu Lys 
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Gly Arg Phe Gin Arg Asp Arg Arg Asn He Arg Pro Asn lie lie Leu 
35 40 45 

Val Leu Thr Asp Asp Gin Asp Val Glu Leu Gly Ser Met Gin Val Met 

55 60 



50 



Asn Lys Thr Arg Arg I 



le Met Glu Gin Gly Gly Thr His Phe lie Asn 



65 



70 
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80 



Ala Phe Val Thr Thr Pro Met Cys Cys Pro Ser Arg Ser Ser lie Leu 



85 

Thr Gly Lys Tyr Val His 
100 



90 



Asn His Asn Thr Tyr Thr Asn Asn Glu Asn 
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110 



% t Cys Ser Ser Pro Ser 



Trp Gin Ala Gin His Glu Ser Arg Thr Phe Ala 



115 



120 



125 



p r i Val Tyr Leu Asn 



Ser Thr Gly Tyr Arg Thr Ala Phe Phe Gly Lys Tyr 
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135 



140 



Leu Asn Glu Tyr Asn Gly Ser Tyr Val Pro Pro Gly Trp Lys Glu Trp 
145 150 155 
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2 Val Gly Leu Leu Lys 



Asn Ser Arg Phe Tyr Asn Tyr Thr Leu Cys Arg 
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0 Asn Gly Val Lys Glu Lys His Gly Ser Asp Tyr Ser Lys Asp Tyr Leu 



180 



185 



Thr Asp Leu lie Thr Asn Asp Ser Val Ser Phe Phe Arg Thr Ser Lys 



195 

Lys Met Tyr Pro His Arg Pro 
210 215 



200 



205 



Val Leu Met Val He Ser His Ala Ala 
220 



Pro His Gly Pro Glu Asp Ser Ala Pro Gin Tyr Ser Arg Leu Phe Pro 

235 240 



225 



230 



Asn Ala Ser Gin His He Thr Pro Ser Tyr Asn Tyr Ala Pro Asn Pro 
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255 



Asp Lys His Trp He Met Arg Tyr Thr Gly Pro Met Lys Pro He His 
260 265 270 



ri Phe Thr Asn Met Leu Gin Arg Lys Arg Leu Gin Thr Leu Met 
Met Glu Phe Thr asn uei. u 2g5 

275 280 

c Mat- Clu Thr He Tyr Asn Met Leu Val Glu Thr 
Ser Val Asp Asp Ser Met Glu Thr y 

290 

* 7\ n Thr Tvr He Val Tyr Thr Ala Asp His Gly Tyr 
Gly Glu Leu Asp Asn Thr Tyr lie * 32Q 

305 JiU 

riw Tuq Ser Met Pro Tyr Glu 
His lie Gly Gin Phe Gly Leu Val Lys Gly Lys ^ 

325 " u 
eh. Asp He Aro VI P» «- Tyr val Ar, Gl» Pro «■ Val Glu Ala 



340 



345 



Gly Cys 



L ,u Asn Pro His U. Val Leu Asn Us Asp Leu Ala Pro Thr 
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360 
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lie Leu Asp He Ala Gly Leu 
370 375 



Asp He Pro Ala Asp Met Asp Gly Lys 



380 



~'t T Thr- Glu Arq Pro Val Asn Arg Phe His 

0 ser He Leu Lys Leu Leu Asp Thr Glu Arg 4Qo 

f y onr> 395 



so 



385 



390 



- ^ Qor - php Leu Val Glu Arg 
L eu Lys Lys Lys Met Arg Val Trp Arg Asp Ser 

405 410 



p ely L ys Leu Leu «i» Lys Aro Asp Asn ^ Asp Ala Gin Glu 
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420 

Glu Asn Phe Leu Pro Lys 
435 



Tyr Gin Arg Val Lys Asp Leu Cys Gin Arg 

445 



440 



Al, Glu Tyr Gin Thr Ala Cys Glu Gin Leu Gly Gin Lys Trp Gin Cys 

450 4 " 
».l Glu Asp Ma Thr Gly Lys Leu Lys Leu His Lys Cys Lys Gly Pro 



465 
Met 



470 



475 



Ty 



t riv riv Ser Arg Ala Leu Ser Asn Leu Val Pro Lys Tyr 
Arg Leu Gly Gly ber My ^ ^ 

485 490 

r Gly Gin Gly S.r Glu Ala Cys Thr Cys Asp Ser Gly Asp Tyr Lys 

505 biU 



500 



Leu Ser Leu Ala Gly Arg Arg Lys Lys Leu Phe Lys Lys Lys Tyr Lys 



515 



520 
4 



Ala Ser Tyr 
530 



535 



540 



Asp Gly Arg Val Tyr 



His Val Gly Leu Gly Asp Ala Ala Gin Pro Arg 



545 



550 



555 



560 



fts „ Leu T h r Lys His ,rp Pro CI, M. « Glu - £P 

565 570 

Lys Asp Gly Gly A3P Ph. 8.. Gly Thr Gly Gly Leu Pro Asp Tyr Ser 



580 



Ala Ala Asn 
595 



Pro lie Lys Val Thr His Arg Cys Tyr He Leu Glu Asn 



600 



605 



A s P Thr Val Gin Cys Asp Leu Asp Leu Tyr Lys Ser Leu Gin Ala Trp 

610 61b 



Lys Asp His Lys Leu 



His He Asp His Glu He Glu Thr Leu Gin Asn 



625 



630 



635 



640 



B Lys ,le Lys Asn L.u Ar, Glu Val Ar g Gly His Leu Lys Lys Lys Arc 

ry 64 5 

to 



650 



Pro 



u • c t \7c Tip Ser Tvr His Thr Gin His Lys 
Glu Glu Cys Asp Cys His Lys lie ber lyr 



360 



665 



5 Gly Arg Leu Lys His Arg Gly Ser Ser Leu His 
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B 



675 



680 



670 

Pro Phe Arg Lys Gly 
685 



Leu Gin Glu Lys Asp Lys 
690 



Val Trp Leu Leu Arg Glu Gin Lys Arg Lys 
695 700 



Lys L ys Leu Arg Lys Leu Leu Lys Arg Leu Gin Asn Asn Asp Thr Cys 
705 710 715 

Ser Met Pro Gly Leu Thr Cys Phe Thr His Asp Asn Gin His Trp Gin 

725 730 



Thr 



Ma Pro Phe Trp Thr Leu Gly Pro Phe Cys Ala Cys Thr Ser Ala 
740 745 750 

A sn Asn Asn Thr Tyr Trp Cys Met Arg Thr lie Asn Glu Thr His Asn 

755 760 765 

Phe Leu Phe Cys Glu Phe Ala Thr Gly Phe Leu Glu Tyr Phe Asp Leu 



770 



775 



780 



Asn Thr Asp Pro Tyr Gin Leu Met Asn Ala Val Asn Thr Leu Asp Arg 



785 



790 



795 



Asp val Leu Asn Gin Leu His Val Gin Leu Met Glu Leu Arg Ser Cys 

805 810 

Lys Gly Tyr Lys Gin Cys Asn Pro Arg Thr Arg Asn Met Asp Leu Gly 

820 

Leu Lys Asp Gly Gly Ser Tyr Glu Gin Tyr Arg Gin Phe Gin Arg Arg 
835 840 8 45 

Lys Trp Pro Glu Met Lys Arg Pro Ser Ser Lys Ser Leu Gly Gin Leu 

y ^ --- 860 



850 

Trp Glu Gly Trp Glu Gly 
865 870 



855 



<210> 3 



m <211> 2610 

P <212> DNA 

?y <213> Homo sapiens 

EO 

I 4 iZlclcc cgagcctcgt gctgtgcttg ctgtccgcaa ctgtgttctc cctgctgggt 60 
I* ggalgctcgg ccttcctgtc gcaccaccgc ctgaaaggca ggtttcagag ggaccgcagg 120 
P facatccgcc ccaacatcat cctggtgctg acggacgacc aggatgtgga gctgggttcc 
0 atgcaggtga tgaacaagac ccggcgcatc atggagcagg gcgggacgca cttcatcaac 240 
W gccttcgtga ccacacccat gtgctgcccc tcacgctcct ccatcctcac cggcaagtac 300 
P g ccacaacc acaacaccta caccaacaat gagaactgct cctcgccctc ctggcaggca 
» cagcacgaga gccgcacctt tgccgtgtac ctcaatagca ctggctaccg ^cagctttc 
ttcgggaagt atcttaatga atacaacggc tcctacgtgc cacccggctg gaaggagtgg 
gtcggactcc ttaaaaactc ccgcttttat aactacacgc tgtgtcggaa cggggtgaaa 540 
qagaagcacg gctccgacta ctccaaggat tacctcacag acctcatcac caatgacagc 600 
'g go tct tccgcacgtc caagaagatg tacccgcaca ggccagtcct catggtcatc 
agccatgcag ccccccacgg ccctgaggat tcagccccac aatattcacg cctcttccca 720 
aacgcatctc agcacatcac gccgagctac aactacgcgc ccaacccgga caaacactgg 
atcatgcgct acacggggcc catgaagccc atccacatgg aattcaccaa catgctccag 840 
cggaagcgct tgcagaccct catgtcggtg gacgactcca tggagacgat tta 900 
ctggttgaga cgggcgagct ggacaacacg tacatcgtat acaccgccga ccacggttac 
cacatcggcc agtttggcct ggtgaaaggg aaatccatgc catatgagtt tgaca 0 0 

gtcccgttct acgtgagggg ccccaacgtg gaagccggct gtctgaatcc ccacatcgtc 
ctcaacattg acctggcccc caccatcctg gacattgcag gcctggacat acctgcggat 
atggacggga aatccatcct caagctgctg gacacggagc ggccggtgaa tcggtttcac 200 
ttgaaaaaga agatgagggt ctggcgggac tccttcttgg tggagagagg caagctgcta 1260 
cacaagagag acaatgacaa ggtggacgcc caggaggaga actttctgcc caagtaccag 1320 



6 



C9 t„ .ect**- MT c Z c = c = ca g 13.0 

s= =r: = r= =:: ^ 

^actcttca agaagaagta caaggccagc tatgtccgca g^y 

gccatcgagg tggacggcag ggtgtaccac gtaggcctgg gtgatgccgc ccagcc 1680 

ctcacc! agcggcactg gccaggggcc cctgaggacc aagatgacaa 
gacttcagtg gcactggagg ccttcccgac tactcagccg ccaaccccat taaagtgaca 
catcggtgct acatcctaga gaacgacaca gtccagtgtg acctggacct 192 0 

rib =i :;=: = == = 

:;;= :=:: u - 
i= = =: s : «. 
riri s= = = ::;= = 
=: := zzz - 

» cagtgtaacc cccggactcg aaacatggac ctgggactta aagatggagg «gct^ 
$ caatacaggc agtttcagcg tcgaaagtgg ccagaaatga agagaccttc ttccaaatca 25^ 
J'j ctgggacaac tgtgggaagg ctgggaaggt 



|Q <210> 4 

Sj <400> 4 

% 000 

n 

y <210> 5 

UJ <400> 5 



000 
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000 
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000 
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<210> 11 

<211> 4397 

<212> DNA 

<213> Homo sapiens 



g gcgagagt ^"g.gtg agtgtgcgtc tgtgtgtccc ggcsagggtg ^ egg 
, gccgggagog cggcc.gccg agtccggagg catcgggagg tcgagagccg =c,ggac=cc 120 
t agctctgogt toactgcccc gtccgg.got ggacttcgg, gccggggccg ggg=cgt,=g 80 

Sg^g gcgggcogg g t =g=ggg=c ^ = - ™ 2 3 4 o°o 

m qaaqagggac aagggaaaag aaacaaagcc acagacgcaa cttgagactc ccg 
3 aaagafgcac cagatcagca aaaaaagaag atgggccccc cgagcctcgt gct^tgcttg 
?! ctgtccgcaa ctgtgttctc cctgctgggt ggaagctcgg ccttcctgtc 9 cacc * c ^ ^ 
* ctgaaaggca ggtttcagag ggaccgcagg aacatccgcc ccaacatcat cctggtgctg 480 
10 a racgacc a gatgtgga gctgggttcc atgcaggtga tgaacaagac ccggcgcat 

^-n^at-r^ar accttcqtqa ccacacccat gtgctgcccc ouu 
3l-naaacaaq qcqggacgca cttcatcaac gcotLtyi-ya 

tcacgc cot ccatcctcac cggcaagtac gtccacaacc acaacaccta caccaacaat 
qaqaltgct cctcgccctc ctggcaggca cagcacgaga gccgcacctt tgccgtgtac 720 
W «a ctggLccg gacagctttc ttcgggaagt atcttaatga atac aacggc 
U tcctacgtgc cacccggctg gaaggagtgg gtcggactcc ttaaaaactc ccgcttttat 8 
0 aactacacgc tgtgtcggaa cggggtgaaa gagaagcacg gctccgacta ctccaaggat 
' 1 tacctcacag acctcatcac caatgacagc gtgagcttct tccgcacgtc caagaagatg 960 
tacccgcaca ggccagtcct catggtcatc agccatgcag ccccccacgg ccctgaggat 1020 

c ccccac aatattcacg cctcttccca aacgcatctc agcacatcac gccgag tac 
aactacgcgc ccaacccgga caaacactgg atcatgcgct acacggggcc catgaagccc 1140 
atccaca gg aattcaccaa catgctccag cggaagcgct tgcagaccct catgtcggtg 1200 
qacgactcS tggagacgat ttacaacatg ctggttgaga cgggcgagct ggacaacacg 260 
tacatcg at acaccgccga ccacggttac cacatcggcc agtttggcct ggtgaaaggg 
a atcc g atgc catatgagtt tgacatcagg gtcccgttct acgtgagggg ccccaacgt 
gaagccggct gtctgaatcc ccacatcgtc ctcaacattg acctggcccc caccatcctg 1440 
gacattgcag gcctggacat acctgcggat atggacggga aatccatcct caagctgctg 
gacacggagc ggccggtgaa tcggtttcac ttgaaaaaga agatgagggt ctggcgggac 
tccttcttgg tggagagagg caagctgcta cacaagagag acaatgacaa ggtggacgcc 620 

actttctgcc caagtaccag cgtgtgaagg acctgtgtca gcgtgctgag 
taccagacgg cgtgtgagca gctgggacag aagtggcagt gtgtggagga cgccacgggg 40 
aagctgaagc tgcataagtg caagggcccc atgcggctgg gcggcagcag agccctctcc 00 
aacctcgtgc ccaagtacta cgggcagggc agcgaggcct gcacctgtga cagcggggac I860 



u 



pi i 



5= =" =s = i 

tatgrccgcd yu y «rrtcacca agcggcactg gccaggggcc 2040 

nfaaacctqq qtgatgccgc ccagccccga aacctcacca ay yy ^ 

gtaggcctgg guy y ^ ^ nArttcaqtg gcactggagg ccttcccgac 2100 

cctqaqgacc aagatgacaa ggatggtggg gacttcagtg y ^ ^ 
tactcagccg cclaccccat taaagtgaca catcggtgct acatcctaga gaacgacaca 2160 
gtccagtgtg acctggacct gtacaagtcc ctgcaggcct ggaaagacca caagctgcac 222 
atcgaccacg agattgaaac cctgcagaac aaaattaaga acctgaggga agtccgaggt 2280 
cacSgaaga aaaagcggcc agaagaatgt gactgtcaca aaatcagcta ccacacccag 
cacaaaggcc gcctcaagca cagaggctcc agtctgcatc ctttcaggaa gggcctgcaa 2400 
aaqaaggaca !ggtgtggct gttgcgggag cagaagcgca agaagaaact ccgcaagctg 2460 
ctcaagcgcc tgcagaacaa cgacacgtgc agcatgccag gcctcacgtg cttcacccac 
caaccagc actggcagac ggcgcctttc tggacactgg ggcctttctg tgcctgc.cc 
Igcgccaaca ataacacgta ctggtgcatg aggaccatca atgagactca caatttcctc 2640 
Tctg gaat ttgcaactgg cttcctagag tactttgatc tcaacacaga cccctaccag 2700 
ctgatgLtg cagtgaacac actggacagg gatgtcctca accagctaca cgtacagctc 2760 
atggagctga ggagctgcaa gggttacaag cagtgtaacc cccggactcg aaacatggac 2820 
ctgggactta aagatggagg aagctatgag caatacaggc agtttcagcg tcgaaagtgg 2880 
ccagaaatga agagaccttc ttccaaatca ctgggacaac tgtgggaagg ctgggaaggt 2940 

agaaacaa cagaggtgga cctccaaaaa catagaggca tcacctgact gcacaggcaa 
tgaaaaacca tgtgggtgat ttccagcaga cctgtgctat tggccaggag ^ctgagaaa 3060 
gcaagcacgc actctcagtc aacatgacag attctggagg ataaccagca ggagcagaga 3 20 
taacttcagg aagtccattt ttgcccctgc ttttgctttg gattatacct caccagctgc 3 0 
acaaaatgca ttttttcgta tcaaaaagtc accactaacc ctcccccaga agctcacaaa 3240 
gaaaac a gaga gcgagc gagagagatt tccttggaaa tttctcccaa gggcgaaag 
cattggaatt tttaaatcat aggggaaaag cagtcctgtt ctaaatcctc "attctttt 3360 
ggtttgtcac aaagaaggaa ctaagaagca ggacagaggc aacgtggaga ggctgaaaac 3 20 
fgtgcagaga cgtttgacaa tgagtcagta gcacaaaaga gatgacattt acct.gc.ct 
ataaaccctg gttgcctctg aagaaactgc cttcattgta tatatgtgac tatttacatg 3540 

aa caacat gggaactttt aggggaacct aataagaaat cccaattttc aggagtgg tg 
gtgtcaataa acgctctgtg gccagtgtaa aagaaaatcc ctcgcagttg tggacatttc 
W tgttcctgtc cagataccat ttctcctagt atttctttgt tatgtcccag aactgatgtt 
D tttttttt.a ggtactgaaa agaaatgaag ttgatgtatg tcccaagttt tgatgaaact 3780 
M Ltttgtaa alaaatttt gtagtttaag tattgtcata cagtgttcaa aaccccagc 

aatgaccagc agttggtatg aagaaccttt gacattttgt aaaaggccat "ctttcttg 390 
ggagtttttt ggtgtgtctg tttttttaaa gtattcaaga tactaccagt "acatcttt 3960 
ttggaagaaa atgccttggg tttagaagat tttcttaaaa ggggagtaga tggttgtaga 4020 
ttgactaaaa agtctaccat acttcaaggg actacaggta agtctcatag tataccagct 4080 
ttggtacttc aLtttaaa aaagtattaa tcaattgcaa agaaattcgc cttggccaac 
ccttctttgt gtatcaggta gtctaacctg atacaagtag ttgacagatt tcaactatca 200 

caccagtc caacccattt ctcatttaac agatgacgga gataatccct aaaagcaccc 
acatttgttt caatgcccca aacaggccaa ggctccctag caactcccta gtggcgtttt 4320 
ttaacttctc agaaactgtt accattattt gaaataggct tccttaacct cctttaccct 4380 
taacccaaca gggattt 
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<213> Homo sapiens 



<400> 12 
Asp Ser Arg 



lie Pro Lys Glu Ala Pro Asp Gin Gin Lys Lys Lys Met 
1 5 10 

20 25 
L eu t eu Gly Gly Ser Ser Ala Phe Leu Ser His His Ar, Leu Lys Gly 

35 40 
Aiq Phe Gin Ar g Asp Ara Arc Asn II. Ar, Pro Asn II. II. Leu V,! 

50 " 
Leu Thr Asp Asp Gin Asp Val Glu Leu Gly Ser Met Gin Val Met Asn 
65 70 
n Lys Thr Ar g Arg He Met Glu Gin Gly Gly Thr His Phe He Asn Ala 
S 85 90 

| Phe Val Thr Thr Pro Met Cys Cys Pro Ser Ar g Ser Ser lie Leu Thr 



P 

^ „• n r, Thr- Tvr Thr Asn Asn Glu Asn Cys 

£0 Gly Lys Tyr Val His Asn His Asn Thr Tyr As ^ 

^ 115 120 

I ser Ser Pro Ser Trp Sin Ala Gin His Glu Ser Ar, T h r Ph, Ala Val 

g no 135 

S Tyr Leu ss „ ser Thr Gly Tyr Ar g Thr Ala Phe Phe Gly Lys Tyr Leu 
m 145 150 

cn a riv Ser Tvr Val Pro Pro Gly Trp Lys Glu Trp Val 
Asn Glu Tyr Asn Gly Ser lyr vai rj. ^ 

165 170 

Gly Leu Leu Lys Asn Ser Ar g Phe Tyr Asn Tyr Thr Leu Cys Arg As„ 

180 185 

Gly val Lys Glu Lys His Gly Ser As P Tyr Ser Lys Asp Tyr Leu Thr 
195 200 205 

ASP Leu lie Thr Asn Asp Ser Val Ser Phe Phe Arg Thr Ser Lys Lys 

210 ZL ^ 
mt Tyr Pro His Ar, Pro Val Leu Met Val II. Ser His Ala Ala Pro 



225 



230 



235 
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# 



# 



rln T . yT . c pr Ara Leu Phe Pro Asn 
His Gly Pro Glu Asp Ser Ala Pro Gin Tyr Ser Arg ^ 

245 



Ala 



Ser Gin His lie Thr Pro Ser Tyr Asn Tyr Ala Pro Asn Pro Asp 

260 265 

Lys His Trp U. Met *r 9 T». Thr Cl» Pro Met Lys Pro He His Met 

275 280 
slu Phe Thr »s„ Met Leu Gl„ «, Lys «, Leu Gin Thr Leu Met Ser 



290 



Val Asp Asp 
305 



295 



Ser Met Glu Thr lie Tyr Asn Met Leu Val Glu Thr Gly 



310 



315 



Glu Leu Asp As 



n Thr Tyr lie Val Tyr Thr Ala Asp His Gly Tyr His 



325 



330 



Ue Gly Gin Phe Gly Leu Vai Ly, Gly Lys Ser Met Pro Tyr Glu Phe 



340 



345 



J Asp He Arg Val Pro Phe Tyr 



Val Arg Gly Pro Asn Val Glu Ala Gly 



Pi 

ru 
to 

'■4 



355 

Cys Leu Asn 
370 



360 



365 



Pro His lie Val Leu Asn He Asp Leu Ala Pro Thr lie 



375 



380 



Leu Asp 



lie Ala Gly Leu As P He Pro Ala Asp Met As P Gly Lys Ser 



^ 385 



390 



395 



He Leu Lys Leu Leu Asp Thr 



Glu Arg Pro Val Asn Arg Phe His Leu 



405 



410 



415 



Lys Lys Lys Met «, Val Trp «, Asp Ser Phe Leu Val Glu «r g Gly 

420 425 

Lys Leu Leu His Lys Arg Asp Asn Asp Lys Val Asp Ala Gin Glu Glu 

435 440 445 

As n Phe Leu Pro Lys Tyr Gin Arg Val Lys Asp Leu Cys Gin Arg Ala 



450 



455 



460 



Glu Tyr Gin Thr Ala Cys Glu Gin Leu Gly Gin Lys Trp Gin Cys Val 
465 470 475 

Glu As P Ala Thr Gly Lys Leu Lys Leu His Lys Cys Lys Gly Pro Met 

485 490 
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# 



Arg Leu Gly Gly Ser Arg Ala Leu Ser Asn Leu Val Pro Lys Tyr Tyr 



500 



505 



Thr Cys Asp Ser Gly Asp Tyr Lys Leu 
520 525 

Ser Leu Ala Gly Arg Arg Lys Lys Leu Phe Lys Lys Lys Tyr Lys Ala 



Gly Gin Gly Ser Glu Ala Cys Thr Cy 

coo 525 
515 bZU 



530 



535 



540 



Ser Tyr Val Arg Ser Arg Ser He Arg Ser Val Ala He Glu Val Asp 



545 



550 



555 



Gly Arg Val Ty 



1 Tyr His Val Gly Leu Gly Asp Ala Ala Gin Pro Arg Asn 



575 



565 570 
3U Thr Lys Arg His Tr P Pro Gly Ala Pro Glu Asp Gin Asp As P Lys 



580 



585 



590 



p Asp Gly Gly Asp Phe Ser Gly Thr Gly Gly Leu Pro Asp Tyr Ser Ala 



600 



ifl 595 

5 Ala Asn Pro lie Lys Val Thr His Arg Cys Tyr lie Leu Glu Asn Asp 



615 



620 



□ 610 

ru 

W Thr Val Gin Cys Asp Leu Asp Leu Tyr Lys Ser Leu Gin Ala Trp Lys 



625 



630 



635 



640 



ij Asp His Lys Leu His He Asp His Glu lie Glu Thr Leu Gin Asn Lys 



645 



650 



Leu Arg Glu Val Arg Gly His Leu Lys Lys Lys Arg Pro 

670 



p«j He Lys Asn 

r: 660 665 



Glu Glu Cys Asp Cys His Ly. 

675 680 



His Lys lie Ser Tyr His Thr Gin His Lys Gly 



685 



Arg Leu Lys His Arg Gly Ser Ser Leu His Pro Phe Arg Lys Gly Leu 



690 



695 



700 



Gin Glu Lys Asp Lys Val Trp Leu Leu Arg Glu Gin Lys Arg Lys Lys 
705 710 715 720 

Lys Leu Arg Lys Leu Leu Lys Arg Leu Gin Asn Asn Asp Thr Cys Ser 
725 730 "5 



Met Pro 



Gly Leu Thr Cys Phe Thr His Asp Asn Gin His Trp Gin Thr 



740 



745 



750 
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• 



T riv Pro Phe Cys Ala Cys Thr Ser Ala Asn 
Ala Pro Phe Trp Thr Leu Gly Pro 765 

155 760 

M , Ara Thr He Asn Glu Thr His Asn Phe 
Asn Asn Thr Tyr Trp Cys Met Arg ^ 

Glu Phe Ala Thr Gly «- - ^ *» P ^ ^0 

790 79b 



Leu Phe Cys 
785 

7u a val Asn Thr Leu Asp Arg Asp 
Thr Asp Pro Tyr Gin Leu Met Asn Ala Val ^ 



805 



810 



• w t fin Leu Met Glu Leu Arg Ser Cys Lys 
Val Leu Asn Gin Leu His Val Gin Leu ^ 
820 825 

Gly TyI W . S1 » CVS «n - - - "» »< r> 

835 



840 



Gly ser Tyr S1 „ Gin TV, Ar, - £ «« 



855 



13 Lys Asp Gly 
3 850 

® * p.o Ser Ser Lys Ser Leu Gly Gin Leu Trp 

M T rp Pro Glu Met Lys Arg Pro Ser g?5 880 

P 865 870 

ry 

W Glu Gly Trp Glu Gly 
H 885 



g 
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